Skip to content

[GLUTEN-10933][VL] BroadcastExchange outputs cudf::table#11441

Merged
jinchengchenghh merged 4 commits intoapache:mainfrom
jinchengchenghh:fix_bhj
Jan 28, 2026
Merged

[GLUTEN-10933][VL] BroadcastExchange outputs cudf::table#11441
jinchengchenghh merged 4 commits intoapache:mainfrom
jinchengchenghh:fix_bhj

Conversation

@jinchengchenghh
Copy link
Contributor

@jinchengchenghh jinchengchenghh commented Jan 19, 2026

A follow up of #11090, broadcast table is usually a small table, so we don't serialize to cudf format in the first step, just convert it to cudf table, this might be expensive for string data type.
May optimize to serialize to cudf format and deserialize to cudf table as hash shuffle reader, copy the buffer to deserialize to cudf::table.

Test in local with TPCDS Q95.

Related issue: #10933

// Need to allocate null buffer.
if (returnBuffers[bufferIdx] == nullptr) {
std::shared_ptr<arrow::Buffer> buffer;
GLUTEN_ASSIGN_OR_THROW(buffer, arrow::AllocateResizableBuffer(bufferSizes[bufferIdx], pool));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should it be arrow::bit_util::BytesForBits(bufferSizes[bufferIdx])

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

@marin-ma marin-ma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks!

@jinchengchenghh jinchengchenghh changed the title BroadcastExchange outputs cudf::table [GLUTEN-10933][VL] BroadcastExchange outputs cudf::table Jan 28, 2026
@jinchengchenghh jinchengchenghh merged commit 55a2332 into apache:main Jan 28, 2026
106 of 107 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants